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Abstract 

The aim of this paper is to analyze the weighted KyFan inequality proposed in m- A 
number of numerical simulations involving the exponential weighted function is given. We 
show that in several cases and types of examples one can imply an improvement of the 
standard KyFan inequality. 
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1 Introduction. The weighted Ky Fan inequality 

1 . 1 . The well-known Ky Fan inequality PEUE] asserts that log det C is a concave function of 
a (strictly) positive definite matrix C. In other words, V strictly positive-definite d x d matrices 
Cl, C2 and Ai, A2 > 0 with Ai -h A2 = 1 , 

logdet^AiCi -h A 2 C 2 ^ — Aftlogdet Ca > 0; equality iff A 1 A 2 = 0 or Ci = C 2 . (1.1) 

a=l,2 

For original ‘geometric^ proofs of (HH) and other related inequalities, see Ref m and the bib¬ 
liography therein. In the derivation of (HU) occupies few lines and is based on prop¬ 

erties of information-theoretical entropies; a similar method allows to derive a number of other 
determinant-related inequalities. 

*Math Dept, Penn State University, PA 16802, USA; DPMMS, University of Cambridge, CB30WB, UK; IITP, 
RAS, 127994 Moscow GSP-4, Russia. E-mail: imsl4@ps.edu, yms@statslab.cam.ac.uk 

^Department of Statistics, Federal University of Sao Carlos (UFSCar), Sao Carlos, Brazil. E-mail: 
sa_ yasaei@yahoo.com 


1 







More precisely, dLH) is equivalent to the bound for Shannon differential entropies for Gaussian 
probability density functions (PDFs): 


^(/fiCi+AaCa) “ ^i^(/c°) “ > 0; equality iff A 1 A 2 = 0 of Ci = C 2 . (1.2) 


Here and below, stands for the d-variate normal PDF N(0, C), with mean 0 and covariance 
matrix C: 




:= 




(1.3) 


\^n J 


Next, h{f) = - [ /(xf) log /(xf)dxf represents the Shannon differential entropy of a PDF /. 

JM.'i 

In the Gaussian case, 


Hfc") ■■=- f /c°(xf log /''°C(xf)dxf = ^ log [(27r)^(det C)' 

.md Z L 


+ 


dlog e 


(1.4) 


Inequality (II.2p is a consequence of the fact that, under certain conditions, h{f) is maximized / = 
/q°. Throughout the paper, we use the abbreviation KFI for either of (II.1|) . (II.2|) . (Sometimes 
the term a standard KFI is also used.) 


1.2. In this paper we compare inequalities dni), (m with weighted inequalities similar 
to (jl.2p and established for weighted differential entropies in the recent paper [TT]; see below. 
For the sake of pre-emptiveness, we call each of these inequalities a weighted Ky Fan inequality 
(WKFI, for short). A WKFI is obtained for a given non-negative weight function; when this 
function equals 1, the WKFI coincides with the KFI. A natural question is whether a WKFI 
can provide an ‘improvement^ to KFI; for instance, by producing a positive lower bound for the 
LHS in (II.ip . (II.2p . We give a numerical evidence that the answer can be yes or no, depending 
on the choice of and A^. We work with so-called exponential weight functions for which all 
calculations simplify. Furthermore, the numerical simulations are done for d = 2, allowing a 
graphical representation of results. 

Let xf € 1 -^ <?^(xi) > 0 be a given non-negative measurable function positive on an open 

domain in M^. Following and HU, under the usual agreement, 0 • log 0 = 0- log (+oc) = 0, 

the weighted differential entropy (WDE) of PDF / with weight funetion (WF) 0 is defined by 

Kif) ■■=- [ <^(xf)/(xf)log /(xf)dxf, (1.5) 

jRd 

assuming that the integral is absolutely convergent. Cf. [DEli. 
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In the Gaussian case, the WDE admits a representation extending the RHS in fll.41) . 

Define a number (a(C) = (^^(C) > 0 and a d x d matrix involving WF (j) and PDF 

fNo. 

JC • 


«(C) = f <^(xf)/g°(xf)dxf, 




<^(xf)/No(xf)dxf. 


(1.6) 


Then 


/^^(/c°) = ^log [(27r)<^(detC) 


+ l^trC-i#c :^a^{C). 


(1.7) 


1.3. The following theorem was proven in m- 


Theorem 1.1 (The WKFI; cf. [TT], Theorem 3.2). Given Ai, A 2 G [0,1] with Ai + A 2 = 1 and 
(strictly) positive-definite Ci, C 2 , set: C = AiCi + A 2 C 2 and ^ = Ai^Ci + ^ 2^02 ~ ^C- 
Assume that, ofr a given WF fi, 


Ai(a(Ci) + A2 o(C 2) ~ o(C) > 0, 


Aio(Ci) + A 2 o(C 2 )-o(C) log (27r)^(detC) 


+ tr C 




< 0. 


Then 


( 1 . 8 ) 


cr^(AiCi + A 2 C 2 ) - Aicr^(Ci) - A 2 cr^(C 2 ) > 0; equality iff A 1 A 2 = 0 or Ci = C 2 . (1.9) 


Observe that when ()){x) = 1, bounds fll.Sp are fulfilled for all choices of and A^, a = 1, 2. 
(In fact, they become equalities.) In this case, inequality (11.91) coincides with (II.2F 

2 Exponential weight functions 

2.1. As was said, in this paper we deal with exponential WFs, of the form 

(/)(xf) = exp where tf G M^. (2.1) 

To shorten the notation, we write from now on x and t instead of x.f and if. Here we use the 
Laplace transform formulas: for fifx) = exp Eqn (11. 6p yields 

(a(C) = exp and $c = Cexp (2.2) 

Hence, for (/)(x) = exp , the WDE (II. 7p becomes 

<Tt(C) = /i(/c°)exp (^t^Ct) (2.3) 
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where /i(/c°) is as in Eqn fll.4p . 

To tackle condition fll.Sp . we introduce the set S = S(Ci, C 2 ; Ai, A 2 ) C 

S = |t G > 0, and < o| where 

Ac^exp — exp and 


a=l,2 




^ Ac, exp - exp 

a=l,2 


log 


(27r)^(detC) 


+ A^exp ^^t^Cct^tr [C ^Cc] — dexp 


a=l,2 


(2.4) 


Theorem o is transformed into Theorem O 

Theorem 2.1 Given positive definite matriees Ci, C 2 and Ai, A 2 G [0,1] with Ai + A 2 = 1; set 
C = AiCi + A 2 C 2 . Assume that t G S. Then 

^(/c°)exp - Ai/i(/c°)exp - A 2 /i(/c°) exp >0; (2.5) 

equality iff A 1 A 2 = 0 or Ci = C 2 . 


2.2. Consequently, V t G S we have that 


exp ( -t^Ct 


i;(t) := |log |^(27re)^(det C) | ( 

Tj Vjlog (27re)'^(detCa) j exp > 


a=l,2 


(2.6) 


Eqn fl2.6p can be called an exponentially weighted (or briefly: an exponential) Ky Fan inequal¬ 
ity (EKFI). When t = 0, the EKFI is reduced to the standard KFI (II.ip . (II.2p . In particular, 
for t = 0, the bounds in (12.4p become equalities for any and A^. (Hence, point t = 0 
lies in S for any choice of and A^.) Consequently, it makes sense to analyze the difference 
A(t) :=E(t)-E(0): 


A(t)(= A(t; Cl, C2; Ai, A2)) = log (det C) 


exp 




a=l,2 

+(ilog(27re) 


1 - exp ( ^t^C„t 


+ y] Aalog(detC, 

T Vexp (lUCat^ +exp (It^Ct^ 


(2.7) 


a=l,2 


If A(t) > 0, we can think of an improvement in the standard KFI, and if A(t) < 0, of a 
deterioration. 
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2 . 3 . It has to be said that set S looks rather involved, and there is no guaranty that it 
is not empty. (In fact, numerical evidence suggests that S = 0 for some choices of and A^.) 
Therefore, it makes sense to explore the behavior of A(t) for t in the whole of In particular, 
a stationary point t satisfying grad^A = 0 is found from 


0 = (ilog(27re) 


AojCo^texp — Ctexp 


a=l,2 


+ ^ Aalog(detCa)Cat 

a=l,2 


exp 




-log (detC)Ct 


exp 


or 


t = A^C-^C^t exp -tT(C,-C)t 


a=l,2 


dloge + log [(27r)^(det Cc^)] 


( 2 . 8 ) 


(2.9) 


dlog e 

An obvious solution is t = 0; we are tempting to suggest that it is unique. To analyze the 
character of this point, let us take the second gradient: 


V?tA(t) = dlog(27re) 


A„(c« + c«tncyexp (Inc«t 


a=l,2 


— (C + Ctt^C^) exp 


+ ^ Ac,log(detCc,)(Cc, + Cc,tt^C^)exp 
a=l,2 

— log (det C)(C + Ctt^C^) exp 


At t = 0 it yields 


V?tA(t) 


t=o 


Aclog(detCc)Cc-log(detC)C. 

a=l,2 


( 2 . 10 ) 


( 2 . 11 ) 


For a local minimum at the origin t = 0 we need the matrix log (det Cc^)Cq.— log (det C)C 

a=l,2 

to be positive definite. In other words, the following property emerges, featuring reduced con¬ 
vexity of the map C C log det C: for given positive definite Ci, C 2 , 


A G [0,1] ^ [ACi + (1 - A)C 2 ] log det [ACi + (1 - A)C 2 ] 

is a convex matrix-valued function. 


( 2 . 12 ) 


Again, we can say that our numerical evidence suggests that t = 0 can be a local extremum (of 
either type) or a saddle point. 


5 














2.4. Let us check the status of the origin in the case o? = 1. Here t = t G M, and = c^, 
a = 1,2, and C — c — AiCi + A 2 C 2 are scalars. Further, function A(t) from Eqn fl2.7|) has the 
form 


A(t) = log(27re) 

F! Vexp 



a=l,2 



ol=1,2 



(1 ^2 

\ 

^a) 

exp 

)-l 


- (log c) 


exp 



1- 

T—1 

1 

V2 ) 



Next, functions and from Eqn fl2.4l) become 




= E Vexp 

- exp (id") 

a=l,2 


= AaCXp (^Cat' 

0 - (§‘=*'‘ 


a=l,2 


a=l,2 

Correspondingly, for set S we obtain: 

'1 


+ ^ Ac, exp 


c — exp 


log (27rc) 


S = < t G M : Ai exp 


A2(ci - C2)t 


Ai exp 


-A2(ci - C2)t^ 


+ A 2 exp 


+ A2exp -Ai(c2-ci)f^ > 1, 

^Ai(c2 - ci)t‘^ - 1 j log (27rc) 


, AiCi 

H-exp 


-A2(ci — C2)t^ 


, A 2 C 2 
H-exp 


-Ai(c2 — Ci)t‘^ 


< 1 


It is can be seen that if 0 < c < l/(27r), set § is unbounded. 

d 

The stationary point is where —A = 0 or 


0 = t log(27re) 


AaC^exp -cexp (^^cr 


a=l,2 


(2.13) 


(2.14) 


(2.15) 


+t y] AQ,CQ,(log CQ,)exp -c(log c)exp 

a=l,2 

Clearly, t = 0 is a solution. Next, we calculate the second derivative ttwA at t = 0. A general 

OF 

expression is: 

dt^ 

+ 


log(27re) 

F! AaCa(l + Cat^)exp (Lat^ 

^ — c(l + ct^) exp 


a=l,2 



Fj VCa(l + cF){\og Ca) exp - c(l + ct^)(log c) exp 


a=l,2 
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Taking into account that ^aCa = c, we obtain that 


a=l,2 




= ^ AaCa(l0g Ca) - clog C > 0. 

t=0 a=l,2 


(2.16) 


Inequality fl2.16p holds since x xlog x is a convex function for x > 0. 


3 Numerical results 


within chosen ranges of argument t = 


around point t = 0. Matrices are taken in the 


Performing numerical simulations, we show the graph of function t A(t) in Eqn (12.7p for d = 2 

^ h ^ 

form I ^ I where > 0 and \pa\ < 1. 

V Pa^a ) 

The simulations show the behavior of function A(t) for chosen matrices Ci and C 2 and values 

( ti\ 

of Ai = A and A 2 = 1 — A over selected ranges of the argument t = I 1 and indicate the set S. 

\t2 ) 

The plots exhibit a variety of possible patterns: positive and negative values of A(t), convexity, 
concavity, global/local minimum/maximum, as well as a saddle point, at t = 0 . (Recall, A > 0 
has been proposed as an improvement whereas A < 0 as a retrogression of a standard KFI.) 



1 



A: ai = 0.1, 02 = 0.1001, p, =-0.666, X= 0.99 


B: Oi = 1, 02 = 1.06, Pi = 0.5, p, =-0.2, A.= 0.01 


Figure 3.1 


In Fgures 3.1 - 3.5, set S - when it is non-empty - is shown in a grey color at a bottom 
horizontal plane. (The level at which this plane is placed has been selected for presentational 
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convenience only.) We want to note that in some examples the origin t = 0 seems to be an 
isolated point in S: it may be a consequence of the fact that the WF (/)(x) = exp (t^x) is 
unbounded for t ^ 0. 

In Figures 3.1. A and 3.1.B the graph of function A(t) lies above the value 0 attained at t = 0. 
This suggests that t = 0 is a global minimum of A(t). Consequently, the matrix in the RHS of 
Eqn fl2.1ip is positive definite for the specified choices of Ci, C 2 and A. 

Apparently, the EKFI holds true far beyond S and yields an improvement of the standard 


KFI. Matrix Ci in Figure 3.1.A is of the form Ci = 


° Cnd C 2 = ' " 2 ' 


0 




ajp2 




The values of A = Ai are chosen to be 0.99 on Figure 3.1.A and 0.001 on Figure 3.I.B. 




The plots above give an impression of set S in Figure 3.1.A showing how our perception 
changes when the range of variables ti and ^2 increases. In this example, the set (shown in the 
























grey color) is, obviously, unbounded. (The hyperbola-type curves are used to provide a geometric 
reference.) On the other hand, the plots below demonstrate that set S in Figure 3.1.B is bounded. 



Next, Figure 3.2 shows a more complex character of behavior. Here t = 0 is, apparently, 
a saddle point for the graph of A(t). Function A(t) takes both positive and negative values. 
However, over set S the EKFI yields an improvement of the standard KFL Here matrices Ca = 

I I, a = 1,2, and Ai = A 2 = A = 1/2. 

V ) 

Further, a similar pattern of behavior is confirmed on Figures 3.3.A and 3.3.B, with the same 
form of matrices Cq., and with Ai = A = 0.99 and 0.5, respectively. 
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Figure3^: = 0.1, 02 = 0.105, pj =-0.5, p, =-0.2, X= 0.5 





A: Oi = 4, ©2 = 4.1, pj = 0.333, P2 = 0.2, A.= 0.99 



Figure 3.3 

In Figure 3.4 we see an example where S is non-empty, and A(t) < 0 for some t G S. In other 
words, this is an example where the the EKFI holds true but does not yield an improvement rela¬ 
tive to the standard KFI. In this example, matrix Ci = 

The value A = Ai = 0.99. 




and Co = 


^2 

(^2P2 


alp2 

<^2 


10 
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^2 

Figure 3.4: Oj = O.L a, = 0.102, p, = O.OL X= 0.99 



A: oi = 3, 02 = 5, Pi = 0J33, = 0.2, X= 0.5 



B: oi = 3.08, ©2 = 4.05, pj =-0.666, p^ =-0.2, X= 0.5 


Figure 3.5 


Finally, in Figures 3.5 the set S is empty. Consequently, the EKFI fails (within the indicated 
range of argument t). Also, function A in Figure 3.5.A takes negative values for t ^ 0 within 
the indicated range but A in Figure 3.5.B takes both negative and positive values within the 

f a? a?Pi \ ( (Jo o'2P2 \ 

indicated range. The matrices are Ci = o C 2 — \ ^ 9 and 

V ^iPi J V ^2^2 cri2^ J 

Ai = A 2 = A = 1/2. The difference between these figures is that in Figure 3.5.A t = 0 is a 
maximum of A (within the depicted range of t) whereas in Figure 3.5.B the maximum is at the 
corner points. 
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